Length | Sentence |
---|---|
15 | Ekuhle Mariya.. |
15 | Khumbula lokhu. |
15 | Khona uHerode,. |
15 | Izwe lami nawe. |
15 | Buka isithombe. |
16 | Umthetho mkhulu. |
16 | 3. Genesise 3:1. |
16 | Nguye onamandla. |
16 | Hayi bahambe ke. |
17 | Ezinye izindaba.. |
Length | Sentence |
---|---|
15 | UJesu uyaphila! |
17 | Buka esithombeni! |
22 | Kuqaphele konke lokhu! |
22 | Yeka ukuwa kwamaqhawe! |
23 | Kadunyiswe uNkulunkulu! |
23 | Kade babe hamba bebuye! |
24 | ILizwi lokuphila- uJesu! |
24 | Lokhu kumele kube nzima! |
24 | Kasingeneni esifundweni! |
24 | Impendulo yami ithi QHA! |
Length | Sentence |
---|---|
16 | Ngabe yini leyo? |
17 | 2. Impumela yini? |
17 | 1. Siqala ngaphi? |
18 | G. Mngane wena ke? |
18 | Okubi noma okuhle? |
18 | Loba utsho okunye? |
19 | Kulungile na lokhu? |
19 | 1. Mngane, wena ke? |
19 | D. Mngane, wena ke? |
19 | E. Mngane, wena ke? |
Here we see the absolutely shortest sentences in the corpus. In three tables we find declarative, exclamatory and interrogative sentences.
The sentences give some insight into the language or the corpus. Moreover, in the case of malformed sentences they may give hints for better preprocessing.
We find only sentences which were accepted by the preprocessing. For language detection, usually a minimum number of known words is necessary. Because of this, some very short sentences may be missing in the corpus.
select char_length(sentence) as le, sentence from sentences where sentence like "%!" and 40>length(sentence) order by le limit 15;
4.1.2 Sentences of fixed length I
4.1.3 Sentences of fixed length II
4.1.4 Sentences of fixed length III
4.1.5 Longest sentences